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ABSTRACT 

An ideal image compression technique for image data archival, retrieval and distribution would b; 
one with the asymmetrical computational requirements of Vector Quantization (VQ), but without the 
complications arising from VQ codebooks. Codebook generation and maintenance are stumbling blocks 
•which have limited the use of VQ as a practical image compression algorithm. Model-based YQ 
(MVQ), a variant of VQ described here, has the computational properties of VQ but does not require 
explicit codebooks. The codebooks are internally generated using mean removed error and Human 
Visual System (HVS) models. The error model assumed is the Laplacian distribution with mean, 
computed from a sample of the input image. A Laplacian distribution with mean, X, is generated with a 
uniform random number generator. These random numbers are grouped into vectors. These vectors are 
further conditioned to make them perceptually meaningful by filtering the DCT coefficients from each 
vector. The DCT coefficients are filtered by muldplying by a weight matrix that is found to be optimal 
for human perception. The inverse DCT is performed to produced the conditioned vectors for th: 
codebook. The only image dependent parameter used in the generation of codebook is the mean, X, that 
is included in the coded file to repeat the codebook generation process for decoding. 

Keywords: Image data compression, vector quantization, model-based vector quantization, image data 
archival and retrieval. 

1. INTRODUCTION 

We have found Vector Quantization (VQ) to be an attractive lossy compression technique for image 
data archival, retrieval and distribution 1 due to its asymmetric computational property'. While VQ 
compression can be computationally demanding, decompression of the VQ coded data is a 
computationally light table lookup process. This means less computational burden on the user 
compared to some compression techniques that are characterized by symmetric computational 
requirements such as JPEG/DCT 2 . Thus VQ is ideal for archive and distribution, where images are 
compressed once at the archiving center and potentially decompressed many times at different remote 
sites. 

The performance of VQ depends heavily on the quality of codebooks employed. The codebook 
generation involves a computationally intensive training process. One of the popular training algorithms 
for VQ codebook generation is the Generalized Lloyd Algorithm (GLA) 3 . See 4 - 5 for other training 
algorithms. The training process is carried out on a set of image data called the training data set. The 
codebook thus generated provides optimal performance (in the rate distortion sense) for a given size of 
the codebook for all the images included in the training set. However, for images outside this training 
set the performance is suboptimal. By carefully selecting the training set, however, an acceptable level 
of performance can be achieved. 

Codebook size is an important issue -in optimal performance of VQ. The compression ratio (CR- 
defined as the ratio of number of bits in the input image to the number of bits in the compressed image) 
for VQ techniques decreases as the logarithm of the codebook size and distortion (normally measured as 
mean squared error between the input image and reconstructed image) decreases linearly with increasing 
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xcodebook size 6 . Therefore, it is important to consider large codebooks for better rate-distortion 
-performance of the YQ. The large codebooks, however, require large computational requirements for 
fcodins. We have used a parallel full search implementation of YQ to solve the computational problems 
Jarisins from large codebooks 7 . Further, since the codebooks involved are large, we cannot afford to 
H include codebook in even,' image that we compress to decode. Thus, generation and maintenance of 
?codebooks pose some significant problems for YQ to be a viable option for data archival, retrieval and 
j distribution. 

VQ would be a more attractive approach for image data archival, retrieval and distribution if it did 
foot require the explicit generation and maintenance of codebooks. We are thus faced with the question 
sof whether we can develop a variant of VQ that does not explicitly depend on the use of codebooks, 
if Our work in this direction has resulted in what we call Model-Based VQ (MVQ). MVQ is based on 
^generating mean removed residual vectors using error models such as Laplacian. The error values are 
Jgenerated using uniform random number generator and shaped according to Laplacian distribution by 
Hosing an inverse function for filtering. The generated error values are then normalized to -1.0 to 1.0. 
ITbe error values thus obtained are independent and identically distributed (i.i.d.). Since we are trying to 
[model actual error residuals from the source which are not independent, we need to process these error 
IjvaJues to impose the actual error characteristics. In our previous work 8 on MVQ we computed the 
^covariance matrix of the mean removed vectors from the source and factorized it into lower and upper 
\ triangular matrices. The lower triangular matrix was used to impose correlations among the i.i.d. 
a Laplacian numbers. This model was called Laplacian Multivariate Model (LMV). The vectors 
1 generated from this model constituted a codebook, which was used to VQ encode the source image. 
Tnis codebook can be completely specified by the seed point used for generating the uniform random 
numbers from -1.0 to 1.0, X (the mean of the Laplacian distribution), and finally the lower triangular 
^matrix (L) . These parameters are included in the coded file of the source image and so the decoder can 
; generate the codebook that is used for coding at the decoding side from the mode] parameters and using 
’the block means along with indexes to the codebook entries that matched the vectors derived from the 
# source image. The results obtained using this model were reasonable, but were not as good as VQ for 
® some NASA images (Thematic Mapper data). 

s? We propose here that the model can be improved further by perceptually weighting the residual 
1' vectors from the model. In the remaining sections we describe MVQ that uses the perceptual weighting 
.of the model generated residual vectors and provide results on set of NASA Earth science images. We 
also compare these results with VQ results on the same image data set. 


2. MODEL-BASED VQ ALGORITHM 


H The MVQ algorithm implicitly generates its codebook using mathematical models based on a few 
® image specific parameters from the source which are transmitted along with the coded image data so 
: that decoding can be performed by generating the same codebook at decoding end. The model 

f simulates the mean removed vectors of the source. The means (block means) from the source, m, of 
vector Xj, are scalar random variables given by 


I m i = 5 / k Sx ij (1) 

j 

§ where x- is j* element of the vector Xj. For all the vectors, we compute the vector means and transmit 
~ them after lossless compression such as JPEG/DPCM or Ziv-Lempel algorithm 10 . The second phase of 
the algorithm is to send residual information in compressed form. The residual vector for 1 th block 
I composed of scalar random variables is given by . - . 
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^rforrnance of the VQ. The large codebooks, however, require large computational requirements for 
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*^ s ini from large codebooks 7 . Further, since the codebooks involved are large, we cannot afford to 
'"-lude codebook in ever}' image that we compress to decode. Thus, generation and maintenance of 
-oj ebooks pose some significant problems for VQ to be a viable option for data archival, retrieval and 
distribution. 

VQ would be a more attractive approach for image data archival, retrieval and distribution if it did 
not require the explicit generation and maintenance of codebooks. We are thus faced with the question 

whether we can develop a variant of VQ that does not explicitly depend on the use of codebooks. 
Q-jr work in this direction has resulted in what we call Model-Based VQ (MVQ). MVQ is based on 
renerating mean removed residual vectors using error models such as Laplacian. The error values are 
cenerated using uniform random number generator and shaped according to Laplacian distribution by 
using an inverse function for filtering. The generated error values are then normalized to -1.0 to 1.0. 
Tne error values thus obtained are independent and identically distributed (i.i.d.). Since we are trying to 
model actual error residuals from the source which are not independent, we need to process these error 
values to impose the actual error characteristics. In our previous work 8 on MVQ we computed the 
covariance matrix of the mean removed vectors from the source and factorized it into lower and upper 
triangular matrices. The lower triangular matrix was used to impose correlations among the i.i.d. 
Laplacian numbers. This model was called Laplacian Multivariate Model (LMV). The vectors 
cenerated from this model constituted a codebook, which was used to VQ encode the source image, 
this codebook can be completely specified by the seed point used for generating the uniform random 
numbers from -1.0 to 1.0, X (the mean of the Laplacian distribution), and finally the lower triangular 
matrix (L) . These parameters are included in the coded file of the source image and so the decoder can 
generate the codebook that is used for coding at the decoding side from the model parameters and using 
the block means along with indexes to the codebook entries that matched the vectors derived from the 
source image. The results obtained using this model were reasonable, but were not as good as VQ for 
some NASA images (Thematic Mapper data). 

We propose here that the model can be improved further by perceptually weighting the residual 
vectors from the model. In the remaining sections we describe MVQ that uses the perceptual weighting 
of the model generated residual vectors and provide results on set of NASA Earth science images. We 
also compare these results with VQ results on the same image data set. 

2. MODEL-BASED VQ ALGORITHM 

The MVQ algorithm implicitly generates its codebook using mathematical models based on a few 
image specific parameters from the source which are transmitted along with the coded image data so 
that decoding can be performed by generating the same codebook at decoding end. The model 
simulates the mean removed vectors of the source. The means (block means) from the source, mj of 
vector Xj, are scalar random variables given by 

m i = V k Sx i j (1) 

j 

where xy is j* element of the vector Xj.. For all the vectors, we compute the vector means and transmit 
them after lossless compression such as JPEG/DPCM or Ziv-Lempel algorithm 30 . The second phase of 
the algorithm is to send residual information in compressed form. The residual vector for i* block 
composed of scalar random variables is given by 
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Rj = Xj - mj U 

where U is a unit vector of same dimension as input vector Xj. In MVQ, the residual vector, R n 
represented by an index I to the model generated codebook (CB) entries: 

MVQ: -> \ ( 3) 

In our implementation, the codebook size (nc, number of entries in the codebook) is 16,384 resulting j n 
14 bit representation for the index, I. The compression ratio (CR) that can be obtained from this schem: 
is given by 


k 

CR = 

b + log2(nc) 


At the decoding end, 


% = CB(I) 


( 5 ) 


where, 5^ is an approximation of R v The reconstructed vector at the decoding end is given by 

= nij + ^ (6) 

The distortion between input vector, X, and the reconstructed vector, X\ is expressed in terms of mean 
squared error (MSE). The MSE for all the vectors drawn from the source image is given by 


D = ££(x ir ^) 2 (7) 

i 3 

The important element of the MVQ is the simulation of residual vectors of the source image and 
building the codebook of 16,384 codevectors from this process. We note why we settled for 16,384 
vectors in the section (below) on implementation on the MasPar. 

The input image is decomposed into square blocks of 4x4, 5x5, etc., depending on the targeted 
compression ratio. From Eq. 4, for a given compression ratio, one can compute the vector size, k(rx c ), 
where r is the number of rows and c is the number of columns of the block of pixels from the source 
image data. For each block of pixels, the block mean is computed, compressed using appropriate 
lossless compression, and transmitted. The residua] vectors are computed by removing the mean from 
the vector formed by raster scan of the image blocks. The residua] vector elements are used to compute 
the mean assuming the residual vector elements fit into a Laplacian distribution. 


P(r) = 


_! g-lxlA. 

IX 


( 8 ) 


where X is the mean of the distribution. The mean is given by 



( 9 ) 
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■^re a e is the standard deviation of the residua] vector elements treated as i.i.d. random variables. 
X r computing X from the residual vectors of the image, it is used to generate Laplacian distribution 
uniform random number generator. If Uj is the i 1 ^ uniform random number (value -1.0 to 1.0) 
" aerated by uniform random number generator, then the random number that forms is Laplacian 
('....bution with mean equal to X is given by 

Vj = -X log £ (1-Uj) (10) 


or - sign is randomly given to vj. 

S umber of vj’s generated depend on the codebook size and vector size. For a vector size of k and 
:,->aebook size of nc, the number of vj’s is equal to the product of k and nc. The random numbers are 
grouped by k elements to form nc vectors that in turn form a codebook. These random numbers are 
independent and identically distributed, whereas the residual vector elements show considerable 
correlations among them which are manifest in nonzero off diagonal elements of covariance matrix. 
Tne most apparent solution to tuning the codebook to actual source is to impose correlations on the 
vector elements of the codebook so that they have approximately same covariance as the source residual 
vectors. 

In our earlier work 8 we decomposed the covariance matrix of the mean removed residual vectors of 
the source into upper and lower triangular matrices and mapped all the codebook entries onto lower 
triangular matrix to impart the cross correlation structure of the input source to randomly generated 
numbers grouped into vectors. Let X s be the covariance matrix of the source mean removed residual 
vectors (of size kxk). By using the Cholesky’s decomposition, Z s is decomposed into lower (L) and 
upper (U) triangular matrices. The lower triangular matrix is used to condition the i.i.d. of the codebook 
vectors as follows. 


q = Lq (ID 

Eq. 1 1 was applied to all the vectors of the codebook. The rate distortion results from this algorithm 
were reasonable, but not comparable to training VQ. The question that faced us was how to improve the 
codebook to get better performance. At this point we brought in the Human Visual Systems (HVS). 
Our conjecture was that the residual vectors generated should be perceptually meaningful for better 
performance. 

2.1 The Role of HVS in the Model Codebook 

The HVS system has been incorporated in JPEG/DCT with excellent results. In JPEG/DCT, the 
DCT coefficients in are quantized based on their significance to human perception. Human visual 
weighted DCT coefficients have been used for progressive image transmission by Chitprasert and Rao 9 
with visually more pleasing results than unweighted DCT coefficients. In order to impose HVS 
properties on codebook vectors generated by model, the vectors have to be DCT transformed and 
weighted using the HVS DCT weight matrix and then inverse transformed. The HVS DCT weight 
matrix we have used for model generated residual vectors is shown in Fig. 1. Note that we have 
modified the coefficient weighting matrix in 9 by making the DC term 0 so that it can be applied to 
residual vectors with 0 DC value. 
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Figure 1: The Human Visual Systems weight function of DCT coefficients. 


The model codebook using HVS is generated as follows. A set of (&*nc) uniformly distributed 
random numbers {u;} are generated and are nonlinearly mapped to { Vj} according to Eq. 10 to fit into 
Laplacian distribution of specified mean A, which is computed from the input source as discussed. Next 
the {vj} are grouped to form set of nc k-element vectors { Cj }. The vectors then undergo DCT 
transformation as given below. 


C x dct = DCT ( Cj ) ( 12 ) 

The next step is to weight each coefficient of and take the inverse DCT transform. 

q = IDCT( C dct * W ) (13) 

where * stands for element by element product of the two matrices. If Cj is smaller than 8x8 block 
(vector size less than 64), the corresponding subset of the W matrix are used in the product. The 
codebook containing { Cj } are now replaced by {£)} and the resulting codebook produces much better 
rate-distortion performance. 


3. IMPLEMENTATION ON THE MASPAR 

MVQ is an asymmetrical algorithm like VQ. While decoding is a table look-up process that can be 
performed quite efficiently on a sequential machine, coding using full search is computationally veiy 
intensive. Others solve VQ coding computational problems by structuring the codebook as tree°. 
However, the price of such structuring is suboptimal performance. We instead solve the computational 
problem by using an implementation on a massively parallel computer to obtain optimal performance 
(in the rate-distortion sense) for a given codebook size by doing full search on the codebook for best 
match of the source residual. 

We have implemented MVQ coding on Goddard Space Flight Center’s 16,384 processor MasPar 
MP-2. Each processing element (PE) has a local memory of 64 Kbytes. We generate 16,384 vectors of 
using the algorithm given in the preceding section and load each vector in each of the 16,384 PEs. We 
chose 16,384 vectors because we have found that this size codebook gives good performance, and 
because this size is most efficient on a 16,384 PE MasPar. The input image is decomposed into blocks 
of 4x4 or 6x6 or 8x8 pixels depending on the rate-distortion requirements and the block means are 
computed on the MasPar by propagating each vector from the image to one PE and the mean removed 
residuals from the input image are computed in parallel. The mean of the vectors are stored in a 
separate file and compressed using a JPEG's lossless compression 2 technique. The input residual vector 
elements are normalized with respect to maximum residual vector element value from the mean 
removed source. This can be accomplished by using MasPar's min, max functions that take time 
proportional to the product of number of bits per pixel and clock cycle of the MasPar. Now, the 
normalized residual vectors are coded one by one by propagating each one to all PEs. The distance 
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^•ween the normalized residual vector from the input source and the codebook entries can be computed 
* V. min distance can be found by using min function of the Maspar. The time taken for the best 
Z"ch search from the codebook takes time proportional to the number of bits in the Euclidean distance 
n vm between source residual vector and codebook entry' (32 bits). 

If the codebook size is smaller than the PE array size by a factor of 2 or its multiple, simple load 
Silencing techniques can be used to gain speedups for smaller codebooks. For example, if the codebook 
j S half the array size, then each codebook entry can be split up into two smaller codebook entries 
,V-; n containing half the number of elements and loaded into local memories of the adjacent PEs. The 
:“ ul residual vector from the source is similarly split into two smaller vectors and propagated such that 
•':r>t half of the vector is placed in PEs with even address and second half in those w'ith odd address PEs. 
7ne distances are then computed in each PE separately and combined by shift operations to find the 
closest matching vector. Codebooks larger than PE array size by factor of 2 or its multiples can be 
handled by using processor virtualization. 

In decoding, w j e retrieve the X from coded file and generate the codebook using the X and the DCT 
coefficient weighting function given in Fig. 1. After the codebook is generated from X, the rest of the 
process of decoding is a simple table lookup process. From the coded file, index for each residual 
Vector is retrieved and the corresponding residual vector is read from the codebook. When the block 
mean is added to this residual vector, the block is reconstructed. 

4. RESULTS AND DISCUSSION 

The MVQ algorithm was tested on some Landsat images. Results are shown for Washington, DC 
downtown image of Thematic Mapper (TM) image of band 4. The original image is shown in Fig. 2a. 
The compressed and reconstructed images are shown in Fig. 2b, c, and d for different compression ratios 
(or different vector sizes). The results are comparable to the more standard VQ in which the codebook 
:s generated by training on a set of images which belong to the same class as the image to be coded. 
The rate distortion curves (Mean Squared Error Vs. Compression Ratio) are showm in Fig. 3 for both VQ 
and MVQ. As can be seen from the figure, the MVQ performance compares very closely with that of 
VQ and for larger compression ratios, it even performs better. The computational burden of computing 
the DCT of the codebook entries at encoding and decoding is a constant term. For large images, the 
constant term for decoding becomes negligibly small. For large images, the decompression of MVQ 
coded images is much better than JPEG/DCT compressed images. The future research in MVQ is to 
address multispectral image compression and also to study the decoding time performance for large 
images and compare the results with JPEG/DCT compression technique. 
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Figure 2. MVQ results on a 512x512 pixel Landsat Thematic Mapper image from over Washington, 
DC. (a) Original image, (b) Result from MVQ compression and reconstruction with a 4x4 vector size 
(CR = 6.5. MSE = 10.5). (c) Result from MVQ compression and reconstruction with a 6x6 vector size 
(CR = 11 . 1 . MSE = 26.0). (d) Result from MVQ compression and reconstruction with a 8x8 vector size 
(CR = 27.4, MSE = 38.7). In the lower right portion of each image a 20x20 pixel section of the image 
from the area of the 14th Street Bridge over the Potomac River is blown up by a factor of 8. 
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